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COMPOSITIONS AND METHODS FOR ALTERING 
AMINO ACID CONTENT OF PROTEINS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit of U.S. Application Serial No. 08/988,015, 
filed December 10, 1997, which is herein incorporated by reference. 

5 

FIELD OF THE INVENTION 
The invention relates to a process for the production of proteins having high 
nutritional properties. The methods find particular use in the production of plants with 
increased levels of amino acids having high nutritional properties through the 
1 0 modification of plant genes. 

BACKGROUND OF THE INVENTION 
Autotrophic organisms can make all of their own amino acids. Other cells utilize 
many preformed amino acids. Humans and other higher animals require a number of 
15 essential amino acids in the diet. These essential amino acids are obtained directly or 
indirectly by eating plants. These essential amino acids include lysine, tryptophan, 
threonine, methionine, phenylalanine, leucine, valine and isoleucine. 

Constructing proteins with higher nutritional value has been a long-sought goal of 
scientists. Traditionally, agricultural scientists concentrated on breeding plants with high 
20 nutritional yield. Typically, these new varieties were richer in carbohydrates but usually 
poorer in essential proteins than the wild type varieties from which they were derived. 
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Seed storage proteins represent up to 90% of total seed protein in seeds of many 
plants. They are used as a source of nutrition for young seedlings in the period 
immediately following germination. The genes encoding them are strictly regulated, 
being expressed in a highly tissue specific and stage specific manner. These genes are 
5 almost exclusively expressed in developing seed. Different classes of seed storage 
proteins may be expressed at different stages in the development of the seed. They are 
typically stored in membrane bound organelles called protein bodies or protein storage 
vacuoles. 

A related group of proteins, the vegetative storage proteins, have similar amino 
10 acid compositions and are also stored in specialized vacuoles. These proteins are 

generally found in leaves instead of seeds. These proteins are degraded upon flowering, 
and are thought to serve as a nutritive source for developing seeds. 

Cereal grains and legume seeds which are key protein sources for the vegetarian 
diet are generally deficient in essential amino acids such as methionine, lysine, and 
15 threonine. Therefore, there is needed means for improving the nutritional quality of these 
proteins. 

SUMMARY OF THE INVENTION 
Compositions and methods for altering the amino acid profiles of proteins without 
20 introducing conformational changes into the protein are provided. The method involves 
preparing a binding partner and/or an interacting molecule which binds to the native 
protein and using such interacting molecule to select for modified proteins retaining the 
native conformation. 

The method finds particular use in altering the nutritional value of proteins. A 
25 plant protein having increased methionine levels is provided. The modified protein 

retains the conformation of the native protein while having significantly higher levels of 
methionine. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 shows VSP homologies. 

VSP-b (same as VSPp) and VSP-a (same as VSPa): Staswick, P.R, (1988), Plant 
Physiol 87, 250-254. 

5 

T.phos (tomato acid phosphatase): Erion, J.L., Ballo, B., May, L., Bussell, J., 
Fox, T.W., & Thomas, S.R., SwissProt database accession number P27061. 

Ph.vulg (Phaseolus vulgaris): Zhon, P-Y., Tanaka, T., Yamauchi, D., & 
10 Minamikawa, T. (1997), Plant Physiol. 113, 479-485. 

Ar.VSP (Arabidopsis thaliana): Yu, D.Y., Quigley, F., & Mache, R., EMBL 
database accession number X79490. 

15 Ar.lA-1, Arl7A-l (Arabidopsis thaliana, floral organs): Utsugi, S., Sakamoto, 

Ogura, Y., Murata, M., & Motoyoshi, F. (1996) Plant Mol. Biol. 32, 759-765. 

Fig. 2 shows proposed VSPp methionine-enriched variants. 
Fig. 3 A shows the hydropathy index computation for sequence VSPp. 
20 Fig. 3B shows the hydropathy index computation for sequence VSPMetlO. 

Fig. 3C shows the hydropathy index computation for sequence VSPMet20. 
Fig. 3D shows the hydropathy index computation for sequence VSPMet30. 
Fig. 4 shows the VSPP-metlO sequence. 

Fig. 5 shows the colony lift assay to detect protein-protein interactions. 

25 

DETAILED DESCRIPTION OF THE INVENTION 
Proteins having altered amino acid profiles are provided. The proteins can be 
designed to be enriched in essential amino acids, including lysine, methionine, 
tryptophan, threonine, phenylalanine, leucine, valine and isoleucine relative to average 
30 levels of such amino acids in the native protein. 
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Generally, knowledge of the three-dimensional (3-D) structure of a given protein 
allows one to engineer amino acid substitutions in a rational manner so as to effect a 
desired change in the property of the protein without compromising the folding process. 
The present invention provides methods for increasing the levels of essential amino acids 
5 within a protein while at the same time the altered protein has the conformation of the 
native protein. 

The present invention provides methods for altering the amino acid content of a 
protein whose 3-D structure is unknown or unavailable. The method may also provide an 
easy method for assessing changes in a protein in which the structure of the protein is 

10 known but tools for confirming conformation of the protein may be unavailable. The 
"conformation" of a protein refers to the spatial arrangement of substituent groups of the 
molecule. The polypeptide chain of a protein has only one conformation (or a very few) 
under normal biological conditions of temperature and pH. This, referred to as the 
"native conformation," confers biological activity. The native conformation is 

15 sufficiently stable so that the protein can be isolated and retained in its native state. 

Therefore, it is important to be able to change the amino acid content of a protein, yet at 
the same time have the protein retain its biological activity. 

The methods of the invention are useful for making amino acid changes within 
proteins whose conformation is unknown or unavailable. Such proteins include the 

20 vegetative storage protein which is believed to play a significant role in supplying amino 
acids for protein deposition during seed fill, and other proteins of the seed. The methods 
of the invention may be used to modify the amino acid composition of any protein. 
Examples of such proteins include but are not limited to wheat endosperm purothionine 
(Mak and Jones (1976) Can J. Biochem. 22:83J); albumins (Higgins et al. (1986) J. Biol. 

25 Chem. 261 : 1 1 124); and methionine rich proteins (Pedersen et al. (1986) J. Biol Chem. 
261:6279; Kirihara et al. (1988) Gene 71:359; Musumura et al. (1989) Plant. Mol. Biol. 
12:123). 

The methods of the invention comprise altering the amino acid composition of a 
protein to produce an engineered protein. The engineered protein will retain the 
30 conformation and activity of the native protein yet have a modified or altered amino acid 
content. In this manner, levels of particular amino acids of interest can be increased or 
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decreased. Of particular interest, is to increase the levels or numbers of essential amino 
acids in the proteins. By essential amino acid is intended, lysine, tryptophan, threonine, 
methionine, phenylalanine, leucine, valine, isoleucine, and cysteine. However, it is 
recognized that the amino acid composition can be changed in various ways, as long as 
5 the changes do not affect the conformation of the final protein. 

The proteins of the invention have been engineered or modified to contain altered 
amino acid levels. The engineered protein retains the conformation of the native protein. 
The method involves preparing binding partners and/or interacting molecules to the 
native protein and utilizing these interacting molecules to determine whether the 

10 engineered protein folds correctly. By "binding partner" or "interacting molecule" is 
intended a molecule which is capable of binding or interacting with the proteins of 
interest. Such binding partners or interacting molecules include antibodies, monoclonal 
antibodies, antibody fragments, proteins, modified proteins, nucleotide sequences, such 
as aptomers, chemical compounds (e.g. carbohydrates, etc.), or combinations thereof. 

15 The interacting molecules also encompass polypeptides that have an intrinsic affinity to 
the protein of interest, particularly such polypeptides that are capable of binding with the 
protein of interest to form an oligomeric complex. For example, VSP-alpha binds VSP 
with high affinity and could be used as an interacting molecule for the altered VSP 
protein. 

20 Methods for antibody production are known in the art. See, for example, 

Antibodies, A Laboratory Manual, Harlow and Lane (Eds.), Cold spring Harbor 
Laboratory Press, Coldspring Harbor, NY (1988), and the references cited therein. See 
also, Radka et al. (1983) J. Immunol. 128:2804; and Radka et al. (1984) Immunogenetics 
19:63. All of which are herein incorporated by reference. 

25 Once antibodies, preferably monoclonal antibodies, are available which bind to 

the native protein, such antibodies can be used to select for modified proteins which 
retain the conformation of the native protein. Strategies to identify residues within a 
protein that might tolerate amino acid substitution include mutational analysis, secondary 
structure prediction, homology comparison, and the like. Such strategies can be used to 

30 identify amino acids within the protein that will tolerate amino acid substitution. 
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By mutational analysis is intended mutagenic PCR and DNA shuffling. See, for 
example, Stemmer, W.P. (1994) Nature 370:389-391; and Stemmer, W.P. (1994) Proc. 
Natl. Acad. Sci. USA 91:10747-10751, herein incorporated by reference. Such methods 
can be used to generate phage display libraries of protein genes containing random 
5 mutations. Phage display is an in vitro selection technology which allows for a foreign 
protein or peptide to be displayed on the surface of filamentous phage, linking the 
phenotype of the phage to its genotype. Molecular repertoires with sufficient diversity 
can be generated using such technology. Proteins which exhibit the correct 
conformation, that is, the native conformation, can be selected for by the ability to bind 

10 antibodies recognizing conformational domains of the native protein. See also Methods 
in Enzymology, Combinatorial Chemistry, JohnN. Abelson (Ed.), Vol 267, Academic 
Press, Inc., San Diego, CA, herein incorporated by reference. Once correctly-folded 
protein variants are determined, subsequent isolation and sequencing of the variants 
reveals the tolerated sites for mutations. Alternatively, correctly folded variants may be 

15 identified by other screening or selection methods such as filter lift-assay and ELISA. 

Substitutions may also be incorporated at secondary structure prediction sites. 
Structural features of the protein are important for proper folding. Sequence analysis 
tools such as the GCG (Wisconsin sequence analysis package, Genetics Computer Group, 
University Research Park, 575 Science Drive, Madison, WI) and PC/GENE (Oxford 

20 Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, CA) can be used to 
analyze protein sequence for secondary structure features such as helices, sheets and 
turns. In this manner, it can be determined whether a particular stretch of amino acids 
may reside on the surface of the protein. Residues on the surface of a protein tolerate 
substitution more readily than buried residues without compromising the structure of the 

25 protein. Utilizing these algorithms, predicted turns and surface regions of the proteins 
can be made. Therefore, predictions can be made into which regions amino acid 
substitutions can be made without affecting conformation. 

Sites for amino acid substitution can also be determined by homology comparison 
to other proteins. Nature has tested the tolerance of protein residues to substitution as 

30 exemplified in the sequences of proteins such as globins and cytochromes from several 
different species, members of which have the same fold. See, for example, Hampsey et 
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al. (1988) FEBS Lett. 231:275; Bashford et al. (1987) J. Mol Biol. 196:199; Lesk and 

Chothia (1980) J. Mol. Biol. 136:225. 

In designing proteins of the invention, hydrophobic residues, such as alanine, 

cysteine, valine, isoleucine, leucine, methionine, phenylalanine, and tryptophan may be 
5 substituted for one another without undue perturbation of the structure. Such residues 

generally occur in the hydrophobic core of the protein. See, Bowie et al. (1990) Science 

247:1306-1310; and Baldwin and Matthews (1994) Curr. Opin. Biotech 5:396-402. See 

also Ladunga and Smith (1997) Prot Eng. 10:187-196, herein incorporated by reference. 

Generally, residues that substitute for one another in related sequences do so by 
10 conserving the physico-chemical properties of the residue and folding of the protein thus 

conserving the 3-D structure of the protein. 

Therefore, the protein to be modified can be compared with homologous proteins. 

Amino acids that are critical to the function and/or folding of the protein would be 

expected to be conserved over time. Therefore, predictions can be made as to which 
15 amino acids can be substituted without affecting the conformation or folding of the 

protein. 

Such selected amino acid substitutions can be made by DNA sequencing, site- 
directed mutagenesis, or other methods which substitute one amino acid with any other 
amino acid. 

20 Once the amino acid substitutions have been made and the conformation 

confirmed by antibody binding, the protein can be expressed using known expression 
systems. Where necessary, the DNA encoding the protein can be synthesized using 
known techniques. Likewise, the nucleotide sequence encoding the protein can be 
contained within expression cassettes. 

25 Utilizing the methods of the invention, proteins can be constructed which have 

increased nutritional quality. That is, the essential amino acid content within the protein 
can be increased to represent at least about 5 -about 10%, preferably at least about 10- 
about 20%, more preferably at least about 20-about 40% of the total amino acid content 
in the protein. 

30 In the same manner, the amino acid content of a subject protein can be altered to 

include at least about 10% amino acid substitutions, additions or deletions, about 20% or 
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even up to about 30% to about 40%. It is recognized that the limitation will be the 
activity of the altered. The present invention provides a convenient and ready mechanism 
to test the activity of the protein by its ability to bind the interacting molecule. 

For convenience for expression in plants, the nucleic acid encoding the modified 
peptides or proteins of the invention can be contained within expression cassettes. The 
expression cassette will comprise a transcriptional initiation region linked to the nucleic 
acid encoding the peptide of interest. Such an expression cassette is provided with a 
plurality of restriction sites for insertion of the gene or genes of interest to be under the 
transcriptional regulation of the regulatory regions. 

The transcriptional initiation region, the promoter, may be native or homologous 
or foreign or heterologous to the host, or could be the natural sequence or a synthetic 
sequence. By foreign is intended that the transcriptional initiation region is not found in 
the wild-type host into which the transcriptional initiation region is introduced. 

The transcriptional cassette will include the in 5 D-3 □ direction of transcription, a 
transcriptional and translational initiation region, a DNA sequence of interest, and a 
transcriptional and translational termination region functional in plants. The termination 
region may be native with the transcriptional initiation region, may be native with the 
DNA sequence of interest, or may be derived from another source. Convenient 
termination regions are available from the Ti-plasmid of A. tumefaciens, such as the 
octopine synthase and nopaline synthase termination regions. See also, Guerineau et al, 
(1991) Mol. Gen. Genet. 262:141-144; Proudfoot (1991) Cell 64:671-674; Sanfacon et al. 
(1991) Genes Dev. 5:141-149; Mogen et al. (1990) Plant Cell 2:1261-1272; Munroe et al. 
(1990) Gene 91:151-158; Ballas et al. 1989) Nucleic Acids Res. 17:7891-7903; Joshi et 
al. (1987) Nucleic Acid Res. 15:9627-9639. 

Where appropriate, the gene(s) expressing the modified proteins may be 
optimized for increased expression in the transformed plant. In this manner, the 
sequences can be synthesized using monocot, dicot or particular plant; i.e. maize, 
soybean, sorghum, wheat, etc., preferred codons for improved expression. Methods are 
available in the art for synthesizing plant preferred genes. See, for example, U.S. Patent 
Nos. 5,380,831, 5,436, 391, and Murray et al. (1989) Nucleic Acids Res. 17:477-498, 
herein incorporated by reference. 
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The expression cassettes may additionally contain 5' leader sequences in the 
expression cassette construct. Such leader sequences can act to enhance translation. 
Translation leaders are known in the art and include: picornavirus leaders, for example, 
EMCV leader (Encephalomyocarditis 5' noncoding region) (Elroy-Stein, 0., Fuerst, T.R, 
5 and Moss, B. (1989) PNAS USA, 86:6126-6130); potyvirus leaders, for example, TEV 
leader (Tobacco Etch Virus) (Allison et al. (1986); MDMV leader (Maize Dwarf Mosaic 
Virus); Virology, 154:9-20), and human immunoglobulin heavy-chain binding protein 
(BiP), (Macejak, D.G., and P. Sarnow (1991) Nature, 353:90-94, untranslated leader from 
the coat protein mRNA of alfalfa mosaic virus (AMV RNA 4), (Jobling, S.A., and 

10 Gehrke, L., (1987) Nature, 325:622-625; tobacco mosaic virus leader (TMV), (Gallie, 

DR. et al. (1989) Molecular Biology of RNA, pages 237-256; and maize chlorotic mottle 
virus leader (MCMV) (Lommel, SA et al (1991) Virology, 81:382-385). See also, 
Della-Cioppa et al. (1987) Plant Physiology, 84:965-968. Other methods known to 
enhance translation can also be utilized, for example, introns, and the like. 

15 The expression cassettes may contain one or more than one nucleic acid 

sequences to be transferred and expressed in the transformed plant. Thus, each nucleic 
acid sequence will be operably linked to 5' and 3' regulatory sequences. Alternatively, 
multiple expression cassettes may be provided. 

Generally, the expression cassette will comprise a selectable marker gene for the 

20 selection of transformed cells. Selectable marker genes are utilized for the selection of 
transformed cells or tissues. Such selectable marker genes are known in the art. See 
generally, G. T. Yarranton (1992) Curr. Opin. Biotech., 3:506-511; Christopherson et al 
(1992) Proc. Natl. Acad. Sci. USA, 89:6314-6318; Yao et al. (1992) Cell, 71:63-72; W. 
S. Reznikoff (1992) Mol. Microbiol., 6:2419-2422; Barkley et al. (1980) The Operon, pp. 

25 177-220; Hu et al. (1987) Cell, 48:555-566; Brown et al (1987) Cell, 49:603-612; Figge 
et al. (1988) Cell, 52:713-722; Deuschle et al. (1989) Proc. Natl. Acad. Aci. USA, 
86:5400-5404; Fuerst et al. (1989) Proc. Natl. Acad. Sci. USA, 86:2549-2553; Deuschle 
et al (1990) Science, 248:480-483; M. Gossen (1993) PhD Thesis, University of 
Heidelberg; Reines et al. (1993) Proc. Natl Acad. Sci. USA, 90:1917-1921; Labow et al. 

30 (1990) Mol. Cell Bio., 10:3343-3356; Zambretti et al (1992) Proc. Natl Acad. Sci. USA, 
89:3952-3956; Bairn et al. (1991) Proc. Natl. Acad. Sci. USA, 88:5072-5076; Wyborski 
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et al. (1991) Nuc. Acids Res., 19:4647-4653; A. Hillenand-Wissman (1989) Topics in 
Mol. and Struc. Biol., 10:143-162; Degenkolb et al. (1991) Antimicrob. Agents 
Chemother., 35:1591-1595; Kleinschnidt et al. (1988) Biochemistry, 27:1094-1104; Gatz 
et al. (1992) Plant J, 2:397-404; A. L. Bonin (1993) PhD Thesis, University of 
5 Heidelberg; Gossen et al. (1992) Proc. Natl. Acad. Sci. USA, 89:5547-5551; Oliva et al. 
(1992) Antimicrob. Agents Chemother., 36:913-919; Hlavka et al. (1985) Handbook of 
Exp. Pharmacology, 78; Gill et al. (1988) Nature 334:721-724; DeBlock et al. (1987) 
EMBO J., 6:2513-2518; DeBlock et al. (1989) Plant Physiol., 91:691-704; Fromm et al. 
(1990) 8:833-839; Gordon-Kamm et al. (1990) 2:603-618. Such disclosures are herein 

10 incorporated by reference. 

The nucleotide sequences of interest of this invention can be introduced into the 
genome of the desired host organism in a variety of techniques known in the art. For the 
purposes of this invention, it will be appreciated to those skilled in the art that any 
conventional transformation vector may be used as long as it is capable of transforming 

15 the organism of choice and it does not have restriction sites in common with those 
comprising the final master insertion cassette. Hence, the detailed experimental 
description of transformation vectors is given by way of illustration only. 

Vector systems are known for the transformation of yeast and bacterial cells. For 
yeast, these include but are not limited to autonomously replicating plasmids (see, for 

20 example, Stearns et al. (1990) Methods Enzymol. 185:280-297); 2-micron circle yeast 
DNA sequences (see, for example, Hollenberg (1982) Curr. Topics Microbiol. Immunol. 
96:119-144; Broach (1983) Methods Enzymol. 101:307-325; MacKay (1983) Methods 
Enzymol. 101:325-343; Armstrong (1989) BioTechnology 13:165-192; Rose (1990) 
Methods Enzymol. 185:234-279); linearized vector DNA (see, for example, see, for 

25 example, Takita et al. (1997) Yeast 13:763-768); artificial chromosome vectors (Burke 
(1987) Science 236:806-812); restriction site bank plasmids (Davison (1987), U.S. Patent 
No. 4,657,858, and Methods Enzymol. 153: 34-54); delta-integration vectors (see, for 
example, Lee and Da Silva (1997) Biotechnol. Prog. 13:368-373); and Agrobacterium- 
based vectors (see, for example, Bundock et al. (1995) EMBO J. 14:3206-3214; Piers et 

30 al. (1996) Proc. Natl. Acad. Sci. USA 93:1613-1618; Risseeuw et al. (1996) Mol. Cell. 
Biol. 16:5924-5932); and Shuttle Vectors (see, for example, Schneider (1991) Methods 
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Enzymol 194:373-388; Singh (1997) Methods Mol. Biol. 62:113-130). See generally 
Hinnen (1980) Curr. Topics Microbiol. Immunol. 96:101-117; Nombela (1985) Revis. 
Biol. Cel. 4:1-25; Parent (1985) Yeast 1(2):83-138; West (1988) BioTechnology 10:387- 
404; Schena (1991) Methods Enzymol. 194:389-398; Schneider (1991) Methods 
5 Enzymol. 194:373-388; and Singh (1997) Methods. Mol. Biol. 62:1 13-130. 

Vector systems used for bacterial transformation include, but are not limited to, 
yeast shuttle vectors (see, for example, Ward (1990) Nucleic Acids Res. 18(17):53 19; 
Strathern (1991) Methods Enzymol. 194:319-329; Soni (1992) Nucleic Acids Res. 20(21) 
5852; Nacken (1994) Nucleic Acids Re. 22:1509-1510; Wehmeier (1995) Gene 165:149- 

10 150); pBR322 and related plasmids such as pBR327 and pKC7 (see, for example, Rao 
and Rogers (1979) Gene 7:79-82; Talmadge and Gilbert (1980) Gene 12:235-241; Smith 
et al. (1995) Microbiology 141(pt. 1): 181-188); pATH vectors (see, for example, Koerner 
et al. (1991) Methods in Enzymol. 194:477-490); yeast plasmids (see, for example, 
Marcil (1992) Nucleic Acids Res. 20:917); and natural replicon ColEI and related 

15 plasmids such as P15A, F, RSF1010, and R616 (see, for example, Muhlenhoff and 
Chauvat (1996) Mol. Gen. Genet. 252:93-100; Sakai and Komano (1996) Biosci. 
Biotechnol. Biochem. 60:377-382; Lee and Henk (1997) Vet. Microbiol. 54:369-374); 
herein incorporated by reference. 

A number of vector systems are also known for the introduction of foreign or 

20 native genes into mammalian cells. These include SV40 virus (see, for example, 

Okayama et al. (1985) Molec. Cell. Biol. 5:1136-1142); Bovine papillomavirus (see, for 
example, DiMaio et al. (1982) Proc. Natl. Acad. Sci. USA 79:4030-4034); adenovirus 
(see, for example, Morin et al. (1987) Proc. Natl. Acad. Sci. USA 84:4626; Yifan et al. 
(1995) Proc. Natl. Acad. Sci. USA 92: 1401-1405; Yang et al. (1996) Gene Ther. 3:137- 

25 144; Tripathy et al. (1996) Nat. Med. 2:545-550; Quantin et al. (1992) Proc. Natl. Acad. 
Sci. USA 89:2581-2584; Rosenfeld et al. (1991) Science 252:431-434; Wagner (1992) 
Proc. Natl. Acad. Sci. USA 89:6099-6103; Curiel et al. (1992) Human Gene Therapy 
3:147-154; Curiel (1991) Proc. Natl. Acad. Sci. USA 88:8850-8854; LeGal LaSalle et al. 
(1993) Science 259:590-599); Kass-Eisler et al. (1993) Proc. Natl. Acad. Sci. USA 

30 90: 1 1498-1 1502); adeno-associated virus (see, for example, Muzyczka et al. (1994) J. 
Clin. Invest. 94:1351; Xiao et al. (1996) J. Virol. 70:8098-8108); herpes simplex virus 
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(see, for example, Geller et al. (1988) Science 241:1667; Huard et al. (1995) Gene 
Therapy 2:385-392; U.S. Patent No. 5,501,979); retrovirus-based vectors (see, for 
example, Curran et al. (1982) J. Virol., 44:674-682; Gazit et al. (1986) J. Virol., 60:19- 
28; Miller (1992) Curr. Top. Microbiol. Immunol. 158:1-24; Cavanaugh et al. (1994) 
5 Proc. Natl. Acad. Sci. USA 91 :7071-7075; Smith et al. (1990) Mol. Cell. Biol. 10:3268- 
3271); herein incorporated by reference. 

Methods of the present invention can be used to facilitate assembly of nucleotide 
sequences of interest for transformation of any plant. In this manner, genetically 
modified plants, plant cells, plant tissue, seed, and the like can be obtained. The 

10 transformation vector and hence method of transformation chosen will depend on the 
type of plant or plant cell, i.e. monocot or dicot, targeted for transformation. Suitable 
methods of transforming plant cells include microinjection (Crossway et al. (1986) 
Biotechniques 4:320-334); electroporation (Riggs et al. (1986) Proc. Natl. Acad. Sci. 
USA 83:5602-5606); Agrobacterium-mediated transformation (Hinchee et al. (1988) 

1 5 Biotechnology 6:915-921); direct gene transfer (Paszkowski et al. (1984) EMBO J. 
3 :2717-2722); and ballistic particle acceleration (see, for example, Sanford et al. U.S. 
Patent 4,945,050; W09 1/10725 and McCabe et al. (1988) Biotechnology 6:923-926). 
Also see, Weissinger et al. (1988) Annual Rev. Genet. 22:421-477; Sanford et al. (1987) 
Particulate Science and Technology 5:27-37 (onion); Christou et al. (1988) Plant Physiol. 

20 87:671-674 (soybean); McCabe et al. (1988) BioTechnology 6:923-926 (soybean); Datta 
et al. (1990) Biotechnology 8:736-740 (rice); Klein et al. (1988) Proc. Natl. Acad. Sci. 
USA 85:4305-4309 (maize); Klein et al. (1988) BioTechnology 6:559-563 (maize); 
WO91/10725 (maize); Klein et al. (1988) Plant Physiol. 91:440-444 (maize); Fromm et 
al. (1990) BioTechnology 8:833-839; and Gordon-Kamm et al. (1990) Plant Cell 2:603- 

25 618 (maize); Hooydaas-Van Slogteren and Hooykaas ( 1 984) Nature (London) 3 1 1 : 763- 
764; Bytebier et al. (1987) Proc. Natl. Acad. Sci. USA 84:5345-5349 (Liliaceae); De Wet 
et al. (1985) In The Experimental Manipulation of Ovule Tissues, ed. G.P. Chapman et 
al., pp. 197-209 (Longman, N.Y.) (pollen); Kaeppler et al. (1990) Plant Cell Reports 
9:415_418; and Kaeppler et al. (1992) Theor. Appl. Genet. 84:560-566 (whisker-mediated 

30 transformation); D'Halluin et al. (1992) Plant Cell 4:1495-1505 (electroporation); Li et al. 
(1993) Plant Cell Reports 12:250-255, and Christou and Ford (1995) Annals of Botany 
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75:407-413 (rice); Osjoda et al. (1996) BioTechnology 14:745-750 (maize via 

Agrobacterium tumefaciens); all of which are herein incorporated by reference. 

The following examples are offered by way of illustration and not by way of limitation. 



5 EXPERIMENTAL 

Three complementary strategies, namely, mutational analysis, secondary structure 
prediction, and homology comparison (see below) have been used to identify amino acids 
within VSPp (vegetative storage protein) that might tolerate methionine substitution. 
Together, results from these strategies facilitated the design of three VSP variants with 
10 increasing methionine content. 

1 . Mutational analysis 

The simple premise behind this strategy was that if one prepared monoclonal 
antibodies that recognized the wild-type VSP, then these same antibodies would, if the 

15 mutant proteins folded correctly, also recognize the engineered proteins. As a first step, 
therefore, mice were injected with VSP purified from soybean leaves, and a panel of 21 
monoclonal antibodies recognizing wild-type VSP has been characterized by ELISA. 
These antibodies also recognize VSPa expressed and purified from Pichia pastoris. 
The following two approaches can be implemented to generate either random or "semi- 

20 rational" mutations in VSPp. Mutagenic PCR and DNA shuffling (Stemmer, W.P. 
(1994) Nature 370, 389-391; Stemmer, W.P. (1994) Proc. Natl Acad. Sci. USA 91, 
10747-10751) can be used to generate phage display libraries of VSPp genes containing 
random mutations. Since these mutations could alter the structure of VSP, correctly- 
folded variants can be selected for by their ability to bind a set of monoclonal antibodies 

25 recognizing different conformational domains of wild-type VSP. Likewise, correctly- 
folded variants can be selected by their abilities to homo-heterodimerize. Correctly- 
folded VSP variants (i.e., those retaining the ability to bind VSP-specific conformational 
antibodies and homo/heterodimerize) can be selected by phage display technology or 
screened using a filter lift assay (see methods). Subsequent isolation and sequencing of 

30 these variants reveals the tolerated mutations. Amino acid substitutions which do not 



RTA01/2072159vl 



- 13- 



Attorney Docket No. 5718-16B 



compromise the VSPP structure may be good candidates for site-directed methionine 
substitutions. 

In addition to this "random" approach, a method for the "semi-rational" 
incorporation of methionines into VSP was developed Although the 3-D structure of 
VSP is uncertain, secondary structure prediction of the protein (see strategy 2 below) 
allowed "semi-rational" methionine substitutions. Analysis of VSPP homology with 
tomato acid phosphatase, a protein with 45% identity to VSPp, as well as other homologs 
allowed additional methionine substitutions (see strategy 3 below). Two methods were 
designed by which to introduce these substitutions. The first method involves DNA 
shuffling in the presence of excess methionine-encoding oligos which, by protein 
secondary structure predictions, are complementary to multiple regions of the VSPJ3 gene 
corresponding to protein loops. The second novel method employed overlap PCR of 
segments of the VSPP gene corresponding to protein loops which have been amplified 
with the methionine-encoding oligos. The methods by which these oligos (corresponding 
to, for example, twenty-two different methionine substitutions) are introduced into VSPP 
result in the production of a library of phage-displayed VSP variants; theoretically each 
variant contains zero to twenty-two additional methionines. Subsequent phage display 
and biopanning of these libraries against VSP-specific monoclonal antibodies can lead to 
the identification of residues in VSP which can accommodate methionine without 
significantly altering the structure of the protein. 

A VSPp mutant library was made by error prone PCR methodology (see below). 
From this pool of mutants, a filter lift assay (see methods) was performed to identify 
properly-folded mutant VSPP based on the ability to bind to either VSPa or a VSP- 
specific monoclonal antibody. Using VSPa as the antigen in a filter lift assay (Fig. 5) 18 
out of 50 VSPP variants tested bound VSPa Sequence analysis of 1 5 of these variants 
revealed a total of 84 point mutations which correlate with 58 AA substitutions and 25 
silent mutations. Together these represent 51 different residues within the 218 AA VSPp. 

2. Secondary structure prediction 

Structural features of a protein are very important for proper folding. Sequence 
analysis tools such as the GCG (Wisconsin Sequence Analysis Package, Genetic 
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Computer Group, University Research Park, 575 Science Drive, Madison, WI) and 
PC/GENE (Oxford Molecular Group, 2105 S. Bascom Avenue, Suite 200, Campbell, 
CA) were used to analyze the VSPp sequence for secondary structure features such as 
helices, sheets and turns and for determining whether a particular stretch of amino acids 

5 might reside on the surface of the protein. Residues on the surface of a protein would 
likely tolerate substitution more readily than a buried residue without compromising the 
structure of the protein. Using these algorithms, numerous predicted turns and surface 
regions of the protein were identified. Many of these regions are expected to tolerate 
methionine substitution. For example residues at positions 25, 30, 32, 37, 44, 65, 67, 

10 102, 121, 130, 160, 163, 164, 169, 198, 202, and 207 in VSPP occur in predicted turn 
regions and were substituted with Met (Table 1). 

3. Homology comparison 

Over time, nature has tested the tolerance of protein residues to substitution, and 

15 this is exemplified in the sequences of proteins such as globins and cytochromes from 
several different species, members of which have the same fold (Hampsey, M.D., Das, 
G., Sherman, F. (1988) FEBS Lett. 231,275; Bashford, D., Chothia, C. & Lesk, A.M. 
(1987) J. Mol. Biol. 196, 199; Lesk, AM. & Chothia, C. (1980) J. MoL Biol 136,225). 
These and other studies have demonstrated that hydrophobic residues (such as Ala, Sys, 

20 Val, He, Leu, Met, Phe and Trp) almost always occur in the hydrophobic core of the 
protein and that they may substitute for each other without undue perturbation of the 
structure (Bowie, J.U., Reidhaar-Olson, J.F., Lim. W.A., & Sauer, RT. (1990) Science 
247, 1306-1310; Baldwin, E.P., & Matthews, B.W. (1994) Curr. Opin. Biotech. 5, 396- 
402). Indeed, it has been observed that "Residue positions that can accept a number of 

25 different side chains, including charged and highly polar residues, are almost certain to be 
on the protein surface. Bowie et al (1990) Science 247: 1306-13 10, have Residue 
positions that remain hydrophobic, whether variable or not, are likely to be buried within 
the structure". Furthermore, in a recent comprehensive analysis of substitution patterns 
in several databases of multiply aligned protein sequences, Ladunga and Smith (1997) 

30 Prot. Eng. 10: 187-196, have concluded that the overall emphasis is on the preservation of 
three dimensional structure of the protein and that residues that substitute for each other 
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in related sequences do so by conserving the physico-chemical properties of the residue 
and the folding of the protein. In the case of VSP, this evolutionary data was utilized by 
comparing the homology of VSPP with six homologous proteins (Fig. 1). Amino acids 
that are critical to the function and/or folding of a protein would be expected to be 

5 conserved over time. For example, cysteine 7 and 29 are conserved in all seven of the 
homologous proteins aligned in Fig. 1. These residues are involved in forming a 
disulfide bond that may be expected to be of importance to the structure of the protein. 
In summary, analysis of the VSPp sequence with its homologs led to the identification of 
3 1 residues (out of 218 amino acids) that in all liklihood will tolerate methionine 

10 substitution. 

Engineering VSPP for increased methionine 
Rational 

Wild-type VSPp contains 1.4% methionine. Using the three strategies described, 

1 5 three different VSPp variants with increasing amounts of methionine have been proposed 
(9.6%, 14.2%, 17.9%, Fig. 2). The overall amino acid composition in each of these 
constructs is presented in Table 2. Construct VSPP-met20 (14.2% Met) contains the 
same 18 Met substitutions as the VSPp-metlO derivative plus an additional 11 Met 
residues. Likewise VSPp-met30 contains the same 29 Met substitutions as VSPp-met20 

20 plus an additional 7 Met residues. Mutational analysis of VSPP resulted in the mutation 
of 51 different amino acids out of the 218 amino acid protein. Although these mutations 
were not methionine substitutions, the types of tolerated substitutions were examined for 
their relevance to substitution to a hydrophobic amino acid. For example, positions 50, 
67, 93, 127, 150, and 164 tolerated mutation to a hydrophobic amino acid (Table 1). 

25 Therefore, it is possible that this same position might tolerate substitution to methionine. 
Positions 62, 67, 76, 127, and 164 are hydrophobic amino acids in VSPP - wild type. The 
observation that these positions tolerate substitution at all suggests they would more 
readily tolerate a conservative substitution (i.e., hydrophobic amino acid to hydrophobic 
amino acid, Table 1). Since residues 32, 50, 65, 67, 76, 93, 127, 150, 160, and 202 

30 allowed non-conservative mutations, it is possible that these positions would tolerate 
mutation to methionine (Table 1). In every case where these amino acids were not 
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changed from or to a hydrophobic amino acid in the mutational analysis, at least one 
additional strategy (i.e., secondary structure or homology comparison) was used to 
rationalize methionine substitution at the particular position. In summary, in the three 
methionine enriched constructs proposed, 12 residues (out of a total of 36) were selected 
5 based at least in part on mutational analysis. More specifically, mutational analysis 
indicated 6/18 methionine substitutions in construct VSPP-metlO, 9/29 in construct 
VSPp-met20, and 12/36 in VSPp-met30 (Table 1). As mentioned, mutational analysis 
revealed 51 different positions within VSPp tolerant to substitutions. Interestingly, 25/51 
(49%) of the mutated positions are located in regions of the protein predicted to exist as 

10 turns, 17/51 (33%) in helices, and 9/51 (18%) in P-sheets. These percentages are 

significantly different from the predicted distribution of turns (25%), helices (25%) and 
P-sheets (50%), indicating that, as expected, the regions of the protein most likely to be 
located on the surface (e.g., turns) can more readily accommodate substitutions without 
compromising the structure of the protein. This suggests the importance of protein 

15 secondary structure prediction as one of the strategies utilized in the identification of 
residues for methionine substitution. 

Since protein turns are generally more surface-exposed regions that do not 
contribute greatly to the overall structure of the protein, these regions were targeted for 
methionine substitution. In fact, out of the 36 positions selected for methionine 

20 substitution, 17 (47.2%) are predicted to occur in turns. In contrast, because p-sheets are 
protein structural elements that generally occur at the core of the protein, these regions 
were avoided in selecting sites for methionine substitution. Out of the 36 positions 
selected for methionine substitution, only 7 (19.4%) are predicted to occur in p-sheets. 
Nearly all of these residues were hydrophobic in wild-type VSPp and were thought to 

25 tolerate methionine based upon the homology comparison strategy. Additionally, 12 
(33.3%) of the residues selected for methionine substitution in the three constructs are 
predicted to occur in helices. In summary, secondary structure prediction is the strategy 
responsible, at least in part, for 17/36 sites targeted for methionine substitution. More 
specifically, secondary structure prediction correlates with the selection of 7/18, 14/29, 

30 and 17/36 amino acids for methionine substitution in constructs VSPp-metlO, VSPP- 
met20, and VSPP-met30 5 respectively (Table 1). 
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Homology comparison was a very informative strategy in selecting residues that 
might tolerate methionine substitution. Accordingly, methionine substitutions in VSPp 
were made by adhering to the following rules and also summarized in Table 1 : 

(a) Conserved residues (highlighted in blue in Fig. 1) were defined as those 
5 residues occurring in more than 5 of the 7 homologs. These were not targeted for 
substitution. The exceptions were: at residue numbers 19, 37, 146 and 179 (one of the 
homologs contained a methionine residue); at positions 67, 80, 130 and 169 (conserved 
hydrophobic amino acid exchanges observed in at least one sequence) and at position 50 
(non-conservative changes from Asn to Ser/Cys in two sequences). 

10 (b) Similarly, non-conserved positions were defined as those containing 

residues with different side-chain properties. Several positions in VSPp were correlated 
with non-conservative amino acids in the homologs (e.g., 5, 19, 25, 30, 37, 44, 60, 62, 65, 
67, 72, 76, 80, 90, 97, 102, 121, 127, 130, 135, 142, 146, 150, 164, 169, 179, 189, 198, 
202, 207, and 217). Such residues likely reside on the surface/turns of the protein and 

15 were considered less important for protein function and/or folding and therefore targeted 
for substitution with methionine. 

(c) In addition, some positions in which at least one other hydrophobic amino 
acid was observed among homologs (e.g., 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13, 14, 15, 16, 
17, 18, 19, 20, 21, 25, 30, 37, 44, 60, 62, 65, 67, 72, 76, 90, and 97) were also expected to 

20 tolerate substitution to the hydrophobic amino acid methionine. Exceptions to this were 
cases in which the hydrophobic amino acid was completely conserved in all 6 homologs 
(e.g., Val 49, Leu 77, Leu 1 10, Leu 1 14, Leu 145, He 157, Leu 158, He 186, Val 187, Leu 
197 and Leu 210). In these cases, the possibility that the specific hydrophobic amino 
acid in the wild-type protein may be playing a role critical for the proper structure and/or 

25 function of the protein was considered. To avoid disturbing this possible role, the 

substitution of any residue that is completely conserved in all 6 homologs examined was 
not proposed. 

(d) Six residues within VSPP that were expected to tolerate methionine 
substitution were identified based on the presence of methionine in analogous positions in 

30 homologs (e.g., 19, 37, 44, 146, 179, and 202). 
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A few additional considerations were observed in selecting amino acids that 
might tolerate methionine substitution. 

(e) We avoided altering histidine residues due to their potential importance in 
phosphatase activity of VSPp (Table 2 and DeWald, D.B., Mason, H.S., & Mullet, IE. 

5 (1992) J. Biol. Chem. 267, 15958-15964). 

(f) Since VSPp is a glycoprotein, this feature may be important for the 
stability and/or function of the protein, substitution of potential glycosylation sites was 
avoided (e.g., Asn 94). 

(g) In addition, wherever possible, charged residues such as Lys, Arg, Glx, 
10 Asx were left untouched to preserve the hydrophobic/hydrophilic balance of the protein 

(Table 2 and Fig. 3 A-D). While wild-type VSPp has a calculated charge of -4, VSPp- 
metlO, VSPP-met20, and VSPP-met30 have calculated charges of -7,-7, and -5, 
respectively. 

As a strategy, homology comparison facilitated, at least in part, the selection of 
15 3 1/36 of the residues proposed for methionine substitution. These selections correlate 
with 18/18, 28/29, and 31/36 residues for constructs VSPp-metlO, VSPP-met20, and 
VSPp-met30, respectively (Table 1). 

Several of the amino acids selected for methionine substitution in the three 
constructs resulted from more than one strategy. In fact, the majority (20/36) of the 
20 targeted residues resulted from at least two strategies, with a few (4/36) resulting from all 
three strategies. 

Experimental results 

A synthetic gene for methionine enriched VSPP-metlO has been constructed. 

25 This synthetic gene differs from wild-type VSPp in that it encodes eighteen additional 
methionines (Fig. 4). Also, a few silent point mutations were introduced into this 
construct to create unique restriction sites. To test whether the proposed VSPP-metlO 
gene was correctly folded, the construct was cloned into the phagemid vector 
pCANTAB-5E and the abilities of the expressed proteins to bind VSP-specific 

30 conformational monoclonal antibodies in a filter lift assay were compared. The results 
indicate that the VSPP-metlO gene was able to bind the same antibodies as wild-type 
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VSPp. This suggests that VSPp-metlO may be correctly folded in an E. coli secretion 
system. 

Together, these interdisciplinary approaches should not only result in the 
engineering of a nutritionally-enhanced VSP, but also provide clues to the structure of 
5 VSP - a protein for which no 3D structure is available. This approach is applicable to any 
protein of interest. 

Methods 

1 . Random mutation of vegetative storage protein (VSPp) by error-prone PCR 
10 The VSPp gene was amplified by mutagenic PCR using primers flanking the gene. 



iveacxion i 


jveaciion z 


Jtve dc Lion j 


xvcdulluil \ 


10mM Tris-HCl 


lOmM Tris-HCl 


10mM Tris-HCl 


lOmM Tris-HCl 


50mM KC1 


50mM KC1 


50mM KC1 


50mM KC1 


9.5mMMgC12 


9.mM mgC12 


9 mM mgC12 


9.mM mgC12 


0.5mMMnC12 


0.5mMMnC12 


0.5mMMnC12 


0.5mMMnC12 


5 ng/ml BSA 


5 ng/ml BSA 


5 ng/ml BSA 


5 ng/mlBSA 


600pmol VSP 


600pmol VSP 


600pmol VSP 


600pmol VSP 


template 


template 


template 


template 


0. 1 (jjm each 


0.1 nmeach 


0. 1 urn each 


0.1 [im each 


primer 


primer 


primer 


primer 


2mM dATP 


200pM dATP 


200pM dATP 


200nM dATP 


200nM dCTP 


2mM dCTP 


200uM dCTP 


200pM dCTP 


200(iM dGTP 


200^ dGTP 


2mM dGTP 


200mM dGTP 


200nM dTTP 


200nM dTTP 


200|jM dTTP 


2mM dTTP 


2 Units Taq Pol 


2 Units Taq Po 


2 Units Taq Pol 


2 Units Taq Pol 



1 cycle (1 min. at 95°C, 1 min. at 51°C, 3 min. at 72°C) 
16 cycles (1 min. at 91 °C, 1 min. at 51°C, 3 min. at 72°C) 
15 1 cycle (1 min. a 91°C, 1 min. at 51°C, 5 min. at 72°C) 
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The products of these four reactions were pooled, and the band corresponding to 
the mutagenized VSPp gene was purified from an agarose gel, digested with Sfil and 
NotI and cloned into the phagemid vector pCANTAB-5E. 



5 2. Filter lift assay 

Fifty E. coli colonies containing randomly mutated VSPp genes were picked as 
small patches to an SB agar plate containing glucose and ampicillin. Patches were 
allowed to grow overnight at 37 DC and were then transferred to a nitrocellulose filter. 
On the surface of an SB agar plate containing ampicillin and IPTG, this filter was placed 

10 on top (cell-side up) of a separate blocked filter to which the antigen (e.g., VSPa) had 
been coated. During an overnight incubation at 30 DC, the cells expressed the VSPp 
variant they encoded. These proteins were able to diffuse through the top filter and, if 
correctly folded, bind the antigen-coated filter below. The next day, the antigen-coated 
filter was washed with PBS-0.05% tween and incubated with HRP/anti-e tag conjugate. 

15 Since the VSPp mutants are cloned into the pCANTAB-5E vector which fuses a C- 

terminal epitope tag (e-tag) to the VSPp protein variants, bound proteins were detected by 
this antibody in combination with enhanced chemiluminescence detection. 
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Table 1 



Proposed Methionine Substitutions 



Homology Comparison 



VSPp 
position 




Mutational 
Analysis 1 


Secondary 
Structure 2 


Original A. A. 
hyrophoibic? 


Met in 
homolog? 3 
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hydrophobic 4 


ComstrmWl^ 


w& 
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Y 


- 
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Y 


- 


3 of 6 
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I 


- 
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Y 


Y-Ar. VSP 


6 of 6 


30 


V 
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T 


Y 


- 


2 of 6 


37 


I 


- 


T 


Y 


Y-T. phos 


6 of 6 


44 


I 
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Y 


Y-T. phos 


lof6 


60 


R 


- 


- 


- 


- 


5 of 6 


62 


V 
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- 


Y 


- 


6 of 6 
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I 


I-T,L 


T 


Y 


- 


5 of 6 
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I 


_ 


- 


Y 


- 


6 of 6 


76 


V 
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_ 


Y 


_ 


5 of 6 
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L 
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Y 


_ 
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_ 


Y 


_ 


3 of 6 
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_ 
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Y 
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Y 


Y-T. phos 
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Y 


_ 
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202 


R 


R-G,T 
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_ 
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Y 
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Y 


_ 
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Y 


_ 


6 of 6 
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_ 
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L 




T 
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80 
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93 
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E 










3 of 6 


160 


D 


D-Y 


T 






Oof 6 
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L 




T 


Y 
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1 Amino acid substitution observed in the mutational analysis. For example, at position 62, a valine to 
alanine substitution was observed. 

2 "T" indicates turn predicted by secondary structure analysis of VSPp. 

3 "Y" indicates the presence of Methionine in the designated VSP homolog. 

4 Includes only aliphatic hydrophobic amino acids such as Leu, lie, Val, and Met. 
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Table 2 

Amino Acid Composition of VSPP-WT and Methionine-Enriched Variants 





VSP3 


VSPp-MetlO 


VSPp-Met20 


VSP3-Met30 


Ala 


13 


13 


13 


13 


Arg 


11 


9 


9 


9 


Asn 


14 


14 


13 


12 


Asp 


11 


11 


11 


10 


Cys 


2 


2 


2 


2 


Gin 


6 


6 


6 


6 


Glu 


19 


19 


19 


18 


Gly 


13 


13 


13 


13 


His 


7 


7 


7 


7 


He 


14 


6 


6 


4 


Leu 


20 


18 


13 


12 


Lys 


15 


14 


14 


14 


MET 


3 (1.4%) 


21 (9.6%) 


32 (14.7%) 


39 (17.9%) 


Phe 


12 


12 


11 


10 


Pro 


9 


9 


8 


8 


Ser 


13 


12 


12 


12 


Thr 


10 


10 


9 


9 


Trp 


3 


3 


3 


3 


Tyr 


12 


12 


12 


12 


Val 


11 


7 


5 


5 




Total 


218 


218 


218 


218 



All publications and patent applications mentioned in the specification are 
indicative of the level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 
1 0 individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended 
claims. 
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THAT WHICH IS CLAIMED: 



1 . A Nucleic acid molecule encoding an engineered protein having altered amino 
acid composition, wherein said amino acid composition has been altered by introducing 

5 amino acid changes into said protein, wherein said engineered protein binds to an 
interacting molecule capable of binding with a corresponding native protein. 

2. The nucleotide sequence of claim 1, wherein said amino acid changes increase the 
levels of at least one essential amino acid in the protein. 

10 

3. The nucleotide sequence encoding of claim 2 wherein said essential amino acid is 
selected from the group consisting of methionine, tryptophan, lysine, valine, 
phenylalanine, isoleucine, leucine, theronine and cysteine. 

15 4. The nucleotide sequence of claim 1, wherein said is vegetative storage protein. 

5. A transformed plant containing within its genome the nucleotide sequence of 
Claim 2. 

20 6. A transformed plant containing within its genome the nucleotide sequence of 
Claim 3. 

7. A transformed plant containing within its genome the nucleotide sequence of 
Claim 4. 

25 

8. A stably transformed plant having inserted into its genome a chimeric gene said 
gene encoding an engineered protein having altered amino acid composition wherein said 
protein, wherein said engineered protein binds to an interacting molecule which binds 
with a corresponding native protein. 

30 
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9. The plant of Claim 8, wherein said amino acid changes increase the levels of at 
least one essential amino acid in the protein. 



10. The plant of Claim 9, wherein said essential amino acid is selected from the group 
5 consisting of methionine, tryptophan, lysine, valine, phenylalanine, isoleucine, leucine, 

theronine and cysteine. 

1 1 . The plant of Claim 10, wherein said essential amino acid is increased to represent 
5% of the total amino acid content of the protein. 

10 

12. The plant of Claim 10, wherein said essential amino acid are increased to 
represent 10% of the total amino acid content of the protein. 

13. The plant of claim 12, wherein said protein is vegetative storage protein. 

15 

14. The plant of Claim 8, wherein said plant is a dicot. 

15. The plant of Claim 8, wherein said plant is a monocot. 
20 16. The plant of Claim 15, wherein said monocot is maize. 

17. The plant of Claim 15, wherein said dicot is soybean. 

18. Seed of the plant of Claim 8. 

25 

19. Seed of the plant of Claim 15. 

20. Seed of the plant of Claim 16. 

30 
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COMPOSITIONS AND METHODS FOR ENHANCING 
THE NUTRITIONAL VALUE OF PROTEINS 



ABSTRACT OF THE DISCLOSURE 
5 Methods and compositions for altering amino acid composition of a protein of 

interest are provided, particularly proteins whose three-dimensional structure is unknown. 
The method comprises creating interacting molecules to the native protein and selecting 
for engineered proteins which retain the native conformation by antibody binding. In this 
manner, the levels of essential amino acids in a protein can be increased yet the biological 
1 0 activity of the protein maintained. 
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VSP homologies (Fig.l) 
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Figure 2 



PROPOSED VSPP METHIONINE-ENRICHED VARIANTS 
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Fig. 3A 

Hydropathy index computation for sequence VSPB- 



Total number of amino acids Is: 213, 
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hydropathic index of USPS from amino acid 1 to amino acid 218. 
Computed using an interval of 9 amino acids. (GMftf - -4.95). 



Fig. 3B 

Hydropathy index computation for sequence VSPM10 

Total number of amino acids is: 218. 
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Hydropathic index of USPN1 from amino acid 1 to amino acid 218. 
Computed using an interval of 9 amino acids. (GRftUV ~ -5.52). 
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Hydropathy index computation for sequence VSPM20. 



Total number of amino acids is: 218. 
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Hydropathic index of USPH28 from amino acid 1 to amino acid 218. 
Computed using an interval of 9 amino acids. (GMflf - -5.68). 



Fig. 3D 



Hydropathy index computation for sequence VSPM30. 



Total number of amino acids Is: 218, 




Hydropathic index of USPM38 from amino acid 1 to amino acid 218. 
Computed using an interval of 9 amino acids. (GRAU¥ - -5.31) . 



Figure 4 
VSPP-metlO sequence 

Sfil 

1 GGCCCAGCCGGCCAGATCTTCGGAGATGAAATGCGCTAGCTTTAGGCTTGCTGTGGAAGC 60 

CCGGGTCGGCCGGTCTAGAAGCCTCTACTTTACGCGATCGAAATCCGAACGACACCTTCG 

61 ACACAACATGCGAGCCTTTAAAACCATTCCTGAAGAGTGCATGGAACCAACAAAGGACTA 120 

TGTGTTGTACGCTCGGAAATTTTGGTAAGGACTTCTCACGTACCTTGGTTGTTTCCTGAT 

121 CATGAATGGCGAACAATTTCGAATGGACTCTAAAACAGTTAACCAACAGGCCTTCTTTTA 180 
GTACTTACCGCTTGTTAAAGCTTACCTGAGATTTTGTCAATTGGTTGTCCGGAAGAAAAT 

181 TGCTAGTGAAATGGAAATGCATCACAACGACATGTTTATATTCGGCATGGATAACACCAT 240 
ACGATCACTTTACCTTTACGTAGTGTTGCTGTACAAATATAAGCCGTACCTATTGTGGTA 

241 GCTCTCTAATATCCCATACTATGAAAAACATGGATATGGGGTGGAGGAATTTAATGAAAC 300 
CGAGAGATTATAGGGTATGATACTTTTTGTAC CTATACCCCACCT CCTTAAATTACTTTG 

301 CTTATATGATGAATGGGTTAACAAGGGCGACGCACCGGCATTGCCAGAGACTCTTAAAAA 360 
GAATATACTACTTACCCAATTGTTCCCGCTGCGTGGCCGTAACGGTCTCTGAGAATTTTT 

361 TTACAACAAGCTGATGTCCCTTGGCTTCAAGATGGTATTCTTGTCAGGAAGGTACCTTGA 420 
AATGTTGTTCGACTACAGGGAACCGAAGTTCTACCATAAGAACAGTCCTTCCATGGAACT 

421 CAAAATGGCCGTAACAGAAGCAAAC CTAATGAAGGCTGGCTTC CACACATGGGAGC AGTT 480 
GTTTTACCGGCATTGTCTTCGTTTGGATTACTTCCGACCGAAGGTGTGTACCCTCGTCAA 

481 AATTCTCAAGGATCCACATCTTATGACTCCAAATGCACTTTCATACAAATCAGCAATGAG 540 
TTAAGAGTTCCTAGGTGTAGAATACTGAGGTTTACGTGAAAGTATGTTTAGTCGTTACTC 

541 AGAGAATATGTTGAGGCAGGGATACAGAATTGTTGGAATGATTGGTGATCAATGGAGCGA 600 
TCTCTTATACAACTC CGTC CCTATGTCTTAACAACCTTACTAACCACTAGTTACCTCGCT 

601 TCTGCTTGGAGACCACATGGGCGAATCTAGAACCTTTAAGCTTCCTAATCCCATGTACTA 660 
AGACGAACCTCTGGTGTACCCGCTTAGATCTTGGAAATTCGAAGGATTAGGGTACATGAT 



661 CATGGAGGCGGCCGC 675 
GTACCTCCGCCGGCG 

Not! 



Figure 5 

Colony lift assay to detect protein-protein interactions 




< — colonies on master filter 
< — VSPcc-coated filter 
< — SB + amp + IPTG 



Layer antigen (VSPa)-coated filter and 
colony lift filter on SB-IPTG-plate 




express VSPp mutants at 30°C 



master filter of colonies 
containing VSPp mutants 
cloned into phagemid vecto 



Correctly-folded VSPp variants diffuse through 
the master filter and bind to the VSPa-coated filter 



wash filter 



VSPa-coated filter is incubated with 
HRP/anti-e tag conjugate 




developed VSPa- 
coated filter 



develop filter with substrate (ECL) 
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